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Sequential estimation of a probability p by means of inverse binomial sampling is considered. For 
MI1M2 > 1 given, the accuracy of an estimator p is measured by the confidence level P[p/pL2 < 
p<p/ii]. The confidence levels co that can be guaranteed for p unknown, that is, such that 
P[p/ < p < p/ii] > Co for all p £ (0, 1), are investigated. It is shown that within the general 
class of randomized or non-randomized estimators based on inverse binomial sampling, there 
is a maximum cq that can be guaranteed for arbitrary p. A non-randomized estimator is given 
that achieves this maximum guaranteed confidence under mild conditions on /ii, ^2- 

Keywords: confidence level; interval estimation; inverse binomial sampling; sequential 
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1. Introduction 

In a sequence of Bernoulli trials with probability of success p at each trial, consider 
the estimation of p by inverse binomial sampling. This sampling scheme, first dis- 
cussed by Haldane (1945), consists in observing the sequence until a given number r 
of successes is obtained. The resulting number of trials iV is a sufficient statistic for 
p (Lehmann and Casella (1998), page 101). The uniformly minimum variance unbiased 
estimator of p is (Mikulski and Smith (1976)) 

r - 1 

and for r > 3, it has a normalized mean square error E[(p — p)^]/p^ (or Var[p]/p^) 
lower than l/(r — 2), irrespective of p (Mikulski and Smith (1976); Sathe (1977); 
Prasad and Sahai (1982)). 

This paper analyzes inverse binomial sampling from a different point of view, related 
to interval estimation. Given /xi,/X2 > 1, the accuracy of an estimator p is measured 



This is an electronic reprint of the original article published by the ISI/BS in Bernoulli, 
2010, Vol. 16, No. 2, 493-513. This reprint differs from the original in pagination and 
typographic detail. 



1350-7265 © 2010 ISI/BS 



494 



L. Mendo and J.M. Hernando 



by the probabihty c that p hes in the interval [p/^2,PMi]- The motivation to use a 
relative interval [p/yLt2,p/xi], instead of an interval [2;i,a;2] with xi,X2 fixed, is the fact 
that P[p/ji2 <P< PMi], unlike P\xi <p< X2\, has a definite meaning independent of p. 
Moreover, using a relative interval allows another interpretation of c: since p/^2 ^ PMi 
if and only ii p/fJ-i <p < p^2, c gives the probability that the true value p is covered by 
the random interval [p//ii,p/i2]. In the sequel, c will be referred to as the confidence (or 
confidence level) associated with ^i, fj.2. 

Recently, Mendo and Hernando (2006, 2008a) have shown that the estimator 

r - 1 

has the following properties for r > 3. The confidence level c associated with fix, 112 has 
an asymptotic value c as p — > 0, namely c = ^{r,{i — l)/i2) — 7('", ('' ^ 1)/mi)i where 

t 



1 / - 

l{'i;t) = —— / s'-iexp(-s)ds 



is the regularized incomplete gamma function. Furthermore, the confidence for any p S 
(0, 1) exceeds this asymptotic value provided that /ii and ^2 satisfy certain lower bounds. 
Similar results have been established (Mendo and Hernando (2008b)) for the uniformly 
minimum variance unbiased estimator (1.1). Using a more general setting in which the 
Bernoulli random variables are replaced by arbitrary bounded random variables, Chen 
(2007) has obtained comparable (although somewhat less tight) results for (1.1), as well 
as for the maximum likelihood estimator 

P=^. (1.3) 

From the aforementioned results, the question naturally arises as to whether the at- 
tained confidence could be improved using estimators other than (1.1)-(1.3). This mo- 
tivates the study of arbitrary estimators based on inverse binomial sampling. In this 
regard, it is noted that 1 — c can be expressed as the risk E[L(p)] corresponding to the 
loss function L defined as 

L{x) = {^' ^^f ^[^/'^^'^'^il' (1.4) 
1^ 1, otherwise. 

Since is a sufficient statistic, for any estimator defined in terms of the observed 
Bernoulli random variables, there exists an estimator that depends on the observations 
through N only and that has the same risk; this equivalent estimator is possibly a ran- 
domized one (Lchmann and Casella (1998), page 33). (The fact that L is non-convex 
prevents application of a corollary of the Rao-Blackwell theorem (Lehmann and Casella 
(1998), page 48) to discard randomized estimators.) Thus, attention can be restricted 
to estimators that depend on the observed variables only through TV; however, both 
randomized and non-randomized estimators have to be considered. 



Optimum guaranteed confidence in inverse binomial sampling 



495 



The purpose of the present paper is to investigate the hmits on the confidence levels 
Co that can be guaranteed (in the sense that the actual confidence equals or exceeds cq, 
irrespective of p) in inverse binomial sampling. More specifically, the objectives are: 

• to determine the suprcmum of infpg(o,i) c over all (randomized or non-randomized) 
estimators based on inverse binomial sampling; 

• to find an estimator, if it exists, that can achieve this suprcmum. 

Consequently, the main focus of this paper is on non-asymptotic results, valid for p € 
(0,1). Nonetheless, asymptotic results for p — will also be derived since, apart from 
their own theoretical importance, they provide an upper bound on (and an indication 
of) what can be achieved for arbitrary p. 

Section 2 shows that lim mfp^Q c has a maximum over all estimators based on inverse 
binomial sampling and computes this maximum. This sets an upper bound on the confi- 
dence that can be guaranteed by an arbitrary estimator. Section 3 establishes that this 
upper bound is also a maximum, that is, estimators exist that can achieve this guaranteed 
confidence. Specifically, estimators are given that guarantee the maximum confidence for 
sufficiently small p and that guarantee the maximum confidence for any p, under certain 
conditions on the relative interval being considered. Section 4 discusses the results, com- 
paring them with those from other works. Section 5 contains proofs of all results in the 
paper. 

2. Asymptotic analysis 

It is assumed in the sequel that ?- > 3. Let t^-'^ denote t(i — 1) • • • (i — j + 1). The probability 
function of N, fin) = P[N = n], is 

f{n)= ^ /(1-p)"-'' forn>r. (2.1) 

As justified in Section 1, it is sufficient to consider (possibly randomized) estimators 
defined in terms of the sufficient statistic A^. Let J- denote the set of all functions from 
{r, r -|- 1, 7' + 2, . . .} to K. A non-randomized estimator p can be described as p = g{N)^ 
with g G J-. Thus, a non-randomized estimator is entirely specified by its function g. 
A randomized estimator is a random variable p whose distribution depends, in general, 
on the value taken by A^. Let n„ denote the distribution function of p conditioned on 
N = n. The randomized estimator is completely specified by the functions n„, n>r. 
Thus, denoting by J-'-jz the class of all functions from {r,r + l,r + 2, . . .} to the set of real 
functions of a real variable, a randomized estimator is defined by a function G G J-n that 
assigns n„ to each n. 

Non-randomized estimators form a subset of the class of randomized estimators. Thus, 
any statement that applies to randomized estimators will also be valid, in particular, 
for non-randomized estimators. The reason that the specialized class of non-randomized 
estimators has been explicitly defined is that, on one hand, their simplicity makes them 
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more attractive for applications and, on the other hand, it wiU be seen that, under certain 
conditions, no loss of optimality in guaranteed confidence is incurred by restricting to non- 
randomized estimators. Throughout the paper, when referring to an arbitrary estimator 
without specifying its type, the general class of randomized estimators (including the 
non-randomized ones) will be meant. 

Given any estimator G G T-n, the confidence associated with /ii, ^2 will be expressed in 
the sequel as a function c(jp) (that is, the dependence on p will be explicitly indicated). 
Given r, /ii and ^2, the latter function is determined by G. An estimator is said to 
guarantee a confidence level Cq in the interval (^1,^2) if c(p) > cq for all {pi,p2)- If 
the interval is (0, 1), then the estimator is said to globally guarantee this confidence level. 
An estimator asymptotically guarantees a certain cg if there exists e > such that the 
estimator guarantees cq in the interval (0,£). 

The problem being addressed can be rephrased as that of optimizing the globally 
guaranteed confidence within the general class of estimators based on inverse binomial 
sampling, or finding a minimax estimator with respect to the risk defined by the loss 
function (1.4). The following proposition, and its ensuing particularization, provides the 
motivation for studying lim inf p_>o c{p) as part of this optimization problem. 

Proposition 1. // a given estimator, with confidence function c{p), guarantees a con- 
fidence level Co in an interval {pi,P2), then, necessarily, liminfp_j.p,-, c(p) > cq for any 
Po e [pi,P2]- 

Particularizing Proposition 1 to pi =po = 0, it is seen that liminfp_j.o c(p) represents an 
upper bound on the confidence levels that can be globally or asymptotically guaranteed. 

An important subclass of non-randomized estimators is formed by those defined 
by functions g € T for which c{p) has an asymptotic value, that is, for which 
limp^o P[p / IJ'2 < g{N) <ppi] exists. The set of all such functions will be denoted J^p. 
As will be seen, another important subclass is that corresponding to the set of functions 
g € for which lim„^oo ngin) exists, is finite and non-zero. This set will be denoted Fn- 
The following result establishes that J^n C J-"p . 

Proposition 2. For a non-randomized estimator defined by g G J--a_, limp_i.oc(p) exists 
and equals c, given by 

c = 7(r, fi/i2) — 7(r, fi//^i), Q= lim ng{n). (2-2) 

The converse of Proposition 2 is not true; that is, T-a ^ T-p. A simple counterexample 
is given by 

/ N \ Lo' In, if there exists fc €E N such that n = 2*^, 
gin) = \ I , . 

yuj/n, otherwise, 

with Lo' ^ ijj. It is easily seen that this function is in Tp, with limp_j.o c{p) = 7(r, 10112) — 
^{r,uj / pLi); however, lim„_>.oo "(7(71) does not exist. 
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Given r, /ii and fi2, the maximum of c over all g G is attained when lim„_i,oo ng{n) 
equals ri*, given by 

^,^^ log/..-log(l/m) ^ (2.3) 

as is readily seen by differentiating c in (2.2). Let c* denote the resulting maximum. 
Defining 

M = ^ll^l2, (2.4) 

the terms W / jii and fi*/i2 can be expressed as 

rlogA/ rMlogM 
"/'^^^M^' "^^^ M-1 ^^-^^ 

and thus 

, / rMlogM\ / r\ogM \ 

The following theorem establishes that c* is not only the maximum of limp_5.o c(p) within 
the subclass of non-randomized estimators defined by J-n, but also the maximum of 
lim inf p_i.o c[p) within the general class of randomized estimators defined by JSi • 

Theorem 1. The maximum o/ liminfp_>.o c(p) over all estimators defined by functions 
G <E Fti is c* , given by (2.6). 

As can be seen from (2.6), c* depends on /^i and ^2 only through M . An explanation 
of this result is as follows. Given ni, fj,2, let p be an arbitrary estimator and consider 
a > 0. If pi and are replaced by /x'^ — a^i and — A*2/a, respectively, defining a 
modified estimator p' = ap, it is clear that p/ IJL'2 <p' < if ^-^d only ii p/^2 < P < Pf-i- 
This shows that any value of liminfp_>o c(j>) that can be achieved for ni, /Li2 can also be 
achieved for a/xi, 112 /a (using a different estimator), and conversely. Thus, c* is the same 
for /ii, fi2 and for a/ii, 112/0.- 



3. An optimum estimator for certain relative intervals 

The asymptotic results in Section 2 impose a limit on the confidence levels that can be 
guaranteed, as established by the following corollary of Proposition 1 and Theorem 1. 

Corollary 1. No estimator can guarantee a confidence level greater than c* , given by 
(2.6), in an interval (0,^2)- 

According to Corollary 1, c* is an upper bound on the confidence that can be guaran- 
teed either asymptotically or globally. It remains to be seen if there exists some estimator 
that can actually guarantee the confidence level c*. If it exists, that estimator will be 



498 



L. Mendo and J.M. Hernando 



optimum from the point of view of guaranteed confidence. A related question is if one 
such optimum estimator can be found within the restricted class of non-randomized es- 
timators. As will be shown, the answer to both questions turns out to be affirmative for 
all values of /xi and ^2 in the case of asymptotic guarantee and for certain values of ^1 
and ^2 hi the case of global guarantee. 

Consider a non-randomized estimator p ~ g{N) of the form 

n + a 

where fl and d are parameters (with D, = lim„_>.oo ng^n)). This is a generalization of the 
estimators (1.1)-(1.3). Note that (3.1) has an asymptotic confidence c given by (2.2) 
and, for it achieves the maximum lim inf c(p) that any estimator can have, 

according to Theorem 1. 

Under a mild condition on d, the estimator given by (3.1) with f2 = 51* can be shown to 
asymptotically guarantee the confidence c* for any fii, ^2 > 0, as established by Theorem 
2 below. 

Theorem 2. The non-randomized estimator 

n* 

p= (3.2) 

^ N + d ^ ' 

asymptotically guarantees the optimum confidence c* given in (2.6) if 

^>\i-r^'4^). (3.3) 



2V Af-1 



where M is defined by (2.4). 



An estimator of this form can also globally guarantee the confidence level c* , provided 
that /ii , /i2 are not too small, as discussed in the following. 

For d E Z, the estimator defined by (3.1) lends itself to an analysis similar to that 
carried out by Mendo and Hernando (2006, 2008a) for the particular case (1.2). This 
allows the derivation of sufficient conditions on fii, fi2 which ensure that c{p) > c for 
all p € (0, 1). The least restrictive conditions are obtained for d = 1 and are given in the 
following proposition, which, particularized to fl = fl* , will yield the desired result on 
globally guaranteeing the optimum confidence. 

Proposition 3. The confidence of the non-randomized estimator 

^ (3.4) 

^ N + l ^ ' 

exceeds its asymptotic value c, given by (2.2), for all p Cz (0,1) if 



(3.5) 
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Particularizing to = il* , Proposition 3 establishes that there exists a non-randomized 
estimator that can globally guarantee the optimum confidence c* for certain values of /Lti , 
/X2- It turns out that for O = $7* and r given, one of the two inequalities in (3.5) implies 
the other. Thus, for each r, only one of the inequalities needs to be considered in order 
to determine the allowed range for fj,i, fi2- The result is stated in the following theorem. 

Theorem 3. The non-randomized estimator 

$1* 

p=-^, (3.6) 

with f2* as in (2.3), globally guarantees the optimum confidence c* given by (2.6) if either 
of the following conditions is satisfied: 

f^>^ /o-e{3,4}; (3.7) 
log M 7' — 1 

— T- > for r > 5, 3.8 

M — 1 r 

where M is defined by (2.4). These conditions can be jointly expressed as M> h{r), 
where h is an increasing function. 

Given r, consider the region of (/ii,/i2) values that satisfy the appropriate condition 
(3.7) or (3.8), with ^i, ^2 > 1- From Theorem 3, the boundary of this region is a contin- 
uous, decreasing, concave curve in (l,oo) x (l,oo), namely, the curve determined by the 
equation /ii/i2 = h{r). The region in question is the union of this curve and the portion 
of the plane lying above and to the right. Furthermore, the region for r' > r contains 
that for r. 

According to the preceding results, the problem of optimum estimation of p, in the 
sense of globally guaranteeing the maximum possible confidence, is solved by the non- 
randomized estimator (3.6) for /ii, fi2 satisfying (3.7) or (3.8). Equivalently, this estimator 
is minimax with respect to the risk defined by the loss function given in (1.4) (maximin 
with respect to confidence). 

The fact that c* depends on /ii, fi2 only through AI gives rise to another interpretation 
of the result in Theorem 3. For r and M given, consider the problem of finding, among all 
interval estimators of p with a ratio M between their end-points, that which maximizes 
the globally guaranteed confidence. The solution, if M satisfies (3.7) or (3.8), is 



r log M rM log M 



{M -1){N + iy {M -1){N +1) 



(3.9) 



and the resulting maximum is c*, as expressed by (2.6). Equivalently, given r and a 
prescribed confidence cg, if a value for M is computed such that (2.6) holds with c* = cq 
and if it satisfies (3.7) or (3.8), then the interval estimator (3.9) minimizes the ratio 
between interval end-points subject to a globally guaranteed confidence level cq . Observe 
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that it is meaningful to prescribe, or minimize, the ratio of the interval end-points, rather 
than their difference, since a given value for the latter might be either unacceptably high 
or unnecessarily small, depending on the unknown p, whereas the ratio has a definite 
meaning, regardless of p. 

According to Proposition 3, conditions (3.5) are sufficient; however, they may not be 
necessary. The same applies to (3.7) and (3.8). Determining the most general conditions 
which assure optimality of (3.6) is a difficult problem.^ Nevertheless, as will be illustrated, 
the sufficient conditions (3.7) or (3.8) cover most cases of interest. 

4. Discussion 

Figure 1(a) depicts the relationship between c* , M and r, for M satisfying (3.7) or (3.8). 
The guaranteed confidence c* is represented, for convenience, as a function of \fM — 1, 
each dashed curve corresponding to a different r. The figure also represents, with solid 
line, the minimum c* that fulfills inequalities (3.7) or (3.8); this corresponds to the 
lowest r for which the applicable inequality holds. Figure 1(b) shows this minimum r as 
a function of \fM — 1, with c* as a parameter. From Corollary 1, this figure also has 
a more general interpretation as the minimum r that is required in order to guarantee 
(either globally or asymptotically) a desired confidence level using any estimator based 
on inverse binomial sampling. 

Given /zi, \ii and c, if the point {^/M — l,c) with M = //i//2 lies in the region above 
the solid curve in Figure 1(a), then there exist values of r for which the estimator (3.6) 
globally guarantees the confidence level c for the relative interval defined by /xi and 
^2'i the minimum such r is displayed in Figure 1(b). As mentioned earlier, the region 
referred to in Figure 1(a) (or, equivalently, conditions (3.7) and (3.8)) covers most cases 
of interest. For example, any confidence greater than 85% can be globally guaranteed for 
any relative interval with \fM — 1 < 1.108, that is, such that < 4.443. 

An important subclass of relative intervals is that for which /xi = = 1 + to, m > 0. 
An interval of this form corresponds to the requirement that p and p do not deviate 
from each other by a factor greater than 1 + m; the parameter m is thus interpreted as a 
relative error margin. The guaranteed confidence and required r in this case can be read 
directly from Figures 1(a) and 1(b), as \fM — 1 = m. This particular case is analyzed 
by Mendo and Hernando (2006, 2008a) for the estimator (1.2). Comparing Figure 1 of 
Mendo and Hernando (2008a) with Figure 1(a), the individual (dashed) curves in the 
latter are seen to be above those in the former (this is most noticeable for small r), in 
accordance with the fact that the estimator (3.6) is optimum. This yields a reduction 
in the error margin m for a given r and a desired guaranteed confidence. For example, 
taking r — 10, the estimator (1.2) guarantees a confidence level of 95% for m = 0.9074, 

^Although c(p) (or a lower bound thereof) can be expressed in terms of the Gauss hypergeometric 
function '2,F\(a, b; c; t) (see the proof of Theorem 2), standard algorithms for evaluation of hypergeometric 
sums (Petkovsek ct al. (1996)) arc not directly applicable because of the existing dependence between b 
and t. 
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whereas (3.6) guarantees the same confidence for m = 0.8808. Furthermore, the latter 
value is the smallest m for which a confidence level of 95% can be guaranteed by any 
estimator with the r in question. 

As another manifestation of the optimum character of the estimator (3.6), the curves 
in Figure 1(b) for ^/M — 1 = m are farther to the left than those in Figure 5(a) of 
Mendo and Hernando (2006). Since, from (2.1), E[iV] ^r/p, for certain combinations of 
m and c, this provides a reduction in average observation time to achieve a globally 
guaranteed confidence c for an error margin m. (The fact that the reduction is obtained 
only for certain combinations of to and c is a consequence of the discrete character of r.) 
Thus, for TO = 50%, the estimator (1.2) requires r = 18 in order to globally guarantee a 
90% confidence level, whereas r = 17 suffices for the estimator (3.6); furthermore, this is 
the lowest required r that can be achieved by any estimator. 

Comparing the attainable region shown in Figure 1(a), that is, the region above the 
solid curve, with the corresponding region in Figure 1 of Mendo and Hernando (2008a), 
they are seen to have similar shape and size, except that for small to, the boundary curve 
is slightly higher in Figure 1(a). Thus, the applicability of the estimator (3.6) is similar 
to that of (1.2) (while achieving better performance). 

Another interesting particularization is /ii = 1 + to, /i2 = 1/(1 — to), < 7ti < 1, which 
corresponds to requiring that the absolute error |p — p| does not exceed a fraction to of the 
true value p. The results for this case can be compared with those of Mendo and Hernando 
(2008a) and Chen (2007). For to = 40% with a globally guaranteed confidence level of 
90%, the estimator (1.2) requires r = 17 (Mendo and Hernando (2008a), Proposition 1). 
The results of Chen (2007), Theorem 2, give a sufficient value of r = 46 for the estimator 
(1.3). On the other hand, from Theorem 3, the estimator (3.6) only requires r = 16; 
moreover, it is assured that any other estimator requires at least this r to guarantee 
(either globally or asymptotically) the same confidence for the to under consideration. 

5. Proofs 

The following notation is introduced for convenience: 

r-iexp(-^) 

W = — -( TTi — > (5-1 

(r-1)! 

ni = [0/(p^i) - d], n2 = \St-lJ.2/p-d\. (5.2) 

Proof of Proposition 1. Let q denote lim inf p_i.po c(p) and assume that q < cq. For 
e = (co — Ci)/2, the definition of limit inferior implies that there exists p^ € (pi,P2) such 
that c(pe) < Ci + £ < Cq and thus the estimator does not guarantee the confidence level 
Co in {pi,p2). This establishes the result. □ 

Lemma 1. For all t>0, the function (f> defined in (5.1) satisfies 

(r- l)'-icxp(-r + l) , , 

^<'P^)<- ^7^-1^1 ^<1- (5-3) 
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Proof. The first inequality in (5.3) is obvious. Let Q,- = (i — 1)''"-'^ exp(— r + l)/(r — 1)!. 
Maximizing (j){t) with respect to t, it is seen that (j>{t) < Qr- Since Qr+i/Qr ~ (1 + !/('' ~ 
l))'-iexp(-l) < 1 and Q3 = 2exp(-2) < 1, it follows that < 1 for aU r > 3. □ 

Proof of Proposition 2. Consider g G and let il = lim„^oo Given e > 0, 
there exists such that \ng{n) — fij < e for all n > n^, that is, {il — e)/n < g{n) < 
{fl + e)/n. Using this, the confidence c{p) = P[p/ ^2 < g{N) < PMi] can be bounded for 
p < (n — e)/(/xine) as 

c{p) > P[\{n + e)/ip^ll)] <N<lin- e)^i2/p\l (5.4) 
cip) < P[\{n - e)/(pA^i)l <N<lin + e)n2/p\]. (5.5) 

Let b{n,p;i) denote the binomial probability function with parameters n and p evalu- 
ated at i. From the relationship between binomial and negative binomial distributions, 
(5.4) is written as 

r—l r—1 

c{p) > b{\{n + e)/{p^l,)^ - *) - ^ b{[{n - e)^i2/p\,P; »). (5.6) 

i=0 i=0 

According to the Poisson theorem (Papoulis and Pillai (2002), page 113), the right-hand 
side of (5.6) converges to j{r, {il — e)/^2) — + e)/Mi) as jn- 0. This implies that 

liminf c{p) > 7(r, {n - e)fi2) - j{r, + e)|^il). (5.7) 

p->0 

Since dj{r,t)/dt = 4>{t), Lemma 1 establishes that < d^{r,t)/dt < 1. Using this, (5.7) 
yields lim infp_j.o c(p) > c — {112 + I/mi)^- Taking into account that this holds for all 
£ > 0, it follows that lim infp_j,o c(p) > c. Applying similar arguments to (5.5) gives 
limsupp_j.o c(p) < c. Thus, limp_j.oc(p) exists and equals c. □ 

Lemma 2 (Abramowitz and Stegun (1970), inequality (4.1.33)). t/{l + t)< 
log(l + t)<t for t > — 1, with equality if and only if t ~0. 

Lemma 3. Let (pk) be a positive sequence which converges to 0. For any vi, V2 with 
V2> vi> 0, the sequence of functions {4>u) defined as 

converges uniformly to 4>{i'), given by (5.1), in the interval [vi,i'2]- 

Proof. Since the sequence (pk) is positive and converges to 0, there exists fci such that 
Pk < minlz^i/ (r—l), 1} for k > ki. Thus, 4ik{v) > for v Cz [vi,V2\, k> fci and, therefore, 

|0,(^) - 0(^)1 = min{0fc(^), 0(:.)} (^maxj^, - l) . (5.9) 
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Lemma 1 implies that min{(/)fe(z/), < 1. On the other hand, 



(t)k{v) 

Substituting into (5.9), we have 

\(j)k{v) - (j){v) \ <cxp 



: exp 



log 



log 



1. 



(5.10) 



From (5.1) and (5.8), it follows that, assuming v € [1^1,1^2] and k>ki, 

r-l 



log 



< 



glog(^l-^) +|(^il-,^log(l-p,) + . 



(5.11) 



Using Lemma 2, the first term in the right-hand side of (5.11) is bounded as follows: 



V 



1=1 



Wk 



i=l 



(5.12) 



As for the second term, since v G [1^1, 1^2], there exists ^2 > fci such that < min{i^i/r, 1} 
for fc > k-i- Thus, (i^/pfe — r) log(l — p^) < 0, whereas v>^. FromLemma2, — pfe/(l— p^) < 
log(l — Pfe) < — Pfe. It follows that 



r] log(l -pfc) + 

Vk 



< max 



r ]Pk + v 

Pk 



Pk 



Pk 



1 -J3fe 



(5.13) 



max<^ r, >pk < max{r, V2}- 

l 1 - Pfc I 1 



Pk 



Pk 



Combining (5.10)-(5.13) yields the following bound, valid for all ly G [i^i, 1^2], k > k2: 



|0fc(i/) - ^{y) \ < exp 



E 



max{r, 1/2} 



V - ipk l-Pk 



\Pk 



1. 



(5.14) 



The right-hand side of (5.14) tends to as — > 00. Therefore, sup^gj^.^^ j,^] \(f>k{'^) — 
also tends to 0, which establishes the result. □ 



Proof of Theorem 1. For a G (0, 1], let the sequence (pk), fc G N be defined as pk = 
aM-'', with M given by (2.4), and let the sequence of intervals (/fc) be defined as 
Ik = (Pfc/M2jPfeMi]- Consider also the sequence (/fc), where fk is the probability function 
of N with parameters r and pk- From (2.1) and (5.8), 



fkin) =Pk(f>k{npk)- 



(5.15) 
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To facilitate the development, it is convenient to first analyze non-randomized estima- 
tors, and then generalize to randomized estimators. Given a non-randomized estimator 
specified by g £ J", let the sequence of sets (Sk) be defined such that n G Sk if and only if 
g{n) G Ik- Since the intervals Ik are disjoint for different k, the sets Sk are also disjoint. 
Let the function a be defined such that <j{n) = fc if and only if n € Sk, with a(n) = 
(or an arbitrary negative value) if n ^ Sk for all fc G N. Thus, a gives the index k of the 
interval Ik that g associates to each n, if any. The function cr is determined by g; and, 
for a given no, (T(rio) can be modified without affecting the rest of values cr(n), n ^ uq 
by adequately choosing g{no). 

For the considered non-randomized estimator, let Ck denote the probability that p lies 
in Ik whcnp^pfe, i.e. Ck^Y^nes^ fki^)- Further, let C{h,II) = Y!kth~^ Ck for h, H en 
arbitrary. Defining 

s{n) = { if n e S'/i U • • • U Sh+h-i, 

\ 0, otherwise, 

the sum C{h,II) can be expressed as X]^r'^("')- follows that 

oo 

C{h,H)<y] max fk{n), (5.16) 

ke{h,....h+H~l} 

n—r 

which holds with equality if the estimator satisfies 

cr(ri) = arg max fkM for all ri g N, n > r, (5-17) 

ke{h,...,h+H~l} 

where, if the maximum is reached at more than one index fc, the argmax function is 
arbitrarily defined to give the lowest such index. 

As for randomized estimators, consider an arbitrary function G G T-jz, that for 
each n specifies n„, the distribution function of p conditioned on N — n. Let I'^ = 
{~Go,Ph+H-i/ 1^2] U (pft/ii, 00). Conditioned on = n, let 7r„jt and tt^j respectively denote 
the probabilities that p is in Ik and in P . Obviously, 

7I"„,;, H h -KnJi+H-l + < = 1. (5.18) 

For N ~ n, the numbers 7r„^;i, . . . , 7Tn,h+H-i, indicate how the conditional probability 
associated to all possible values of p (that is, 1 in total) is divided among Ih, . . . , Ih+H-i, 
P . For a given no, any combination of values '!rnQ,h, ■ ■ ■ ,T^no,h+H-i,T^no allowed by (5.18) 
can be realized, without affecting other values 7r„ fc, Trjj for n^UQ, by adequately choosing 
the distribution function n„Q . Defining Ck and C(/i, H) as in the non-randomized case, 
the former is expressed as 



00 

Ck = y^7r„^fc/fc(n). 

n—r 



(5.19) 
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Since all terms in (5.19) are positive, the series converges absolutely, and thus C{h,H) 
can be written as 

h+H-l oo h+H-1 

c{h,H)= J2 cfc = E E ^".fe/fcW- 

k=h n=r k=h 

It is evident from this expression that C{h,H) is maximized if, for each n, the values 
T^n.k, T^n choscu as 7r„^; = 1 for Z = argmaxfc fkin), 7Tn,k =0 for k ^ I, Trjj = 0; and 
the resulting maximum coincides with the right-hand side of (5.16). Thus the inequality 
(5.16), initially derived for non-randomized estimators, also holds for the general class 
of randomized estimators. This implies that for any randomized estimator there exists a 
non-randomized estimator that attains the same or greater C{h,H). 

Let the function g € he defined as g{n) = fl* /n, and consider the non-randomized es- 
timator specified by this function. The sets Sk and function a associated to this estimator 
will be denoted as Sk and a respectively. It is seen that 

Sk = [n*/{jiipk), n*ti2/Pk) n {r, r + 1, r + 2, . . .}. (5.20) 

Let the sequence of functions (fk) be defined as fkin) =Pk4>{npk). It is readily seen 
that the equation fk{t) = fk+i{t) has only one solution, given as t = D,*H2/Pk- Similarly, 
fk-i{t) = fk{t) has the solution t = ft* /{jiipk). Taking into account that the functions 
fk are unimodal, this implies that /fc(n) > fk-i{n) and fk{n) > fk+i(n) for n £ Sk- An 
analogous argument shows that /^(n) > fk-i{n) and fkin) > fk+i{n) for Sk and i>2. 
Therefore, the function a associated to g satisfies a modified version of (5.17) in which 
fk is replaced by fk-, that is, (j)k in (5.15) is replaced by its limit 0. In the following, using 
the fact that the difference between (j>k and is small for large k, the estimator defined 
by g will be used to derive from (5.16) a more explicit upper bound on C{h, H). Since the 
sum C{h,H) attained by any estimator is equalled or exceeded by some non-randomized 
estimator, it will be sufficient to restrict to the class of non-randomized estimators. 

Consider an arbitrary non-randomized estimator defined by g e with its correspond- 
ing function a. Differentiating (2.1) with respect to p, it is seen that df{n)/dp is positive 
for p <r/n and negative for p > r/n. According to (5.20), 

n*/{Hin) <pk <^l*^i2/n forneSk- (5.21) 

Using Lemma 2, it stems from (2.5) that 

n*/fll<r<n*^i2■ (5.22) 

The first inequality in (5.21) and the second in (5.22) imply that pk-i = Mpk > il* 112/n > 
r/n for n G Sk- Thus df{n)/dp < for p > pk-i- This implies that fk+i(n) < fk-i(n) for 
n G Sk and i < —2. It follows that if cr^no) < <7{nQ) — 1 for a given ng, the sum C(/i, H) 
could be made larger by modifying the value g{no) so as to attain cr(no) = ^(ng) — 1; 
unless (T(no) = h + H — I, in which case modifying gino) cannot make C{h,H) larger. 
Analogously, the second inequality in (5.21) and the first in (5.22) yield fk+i{n) < fk+i{n) 
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for nG Sk and i>2. Therefore, if (T(no) > (T(no) + 1 for some no, C{h, H) could be made 
larger by modifying gin^), miless (T(no) = h. 

According to the above, in order to obtain an upper bound on C{h,H), it suffices 
to consider non-randomized estimators such that cr{n) G {d'{n) — l,a{n),a{n) + 1} for 
n G 5/1 U • • • U Sh+H-i, ""(n) = /i for n < / {^iph)] and cr{n) ~ h + H — 1 for n> 
\^*IJL2/ph+H-i\- Thus, from (5.16), 



h+H-i \ir /(^llPH)^-l 

C{h,H)< max A+,(n)+ ^ /,(n) 



+ J2 h+H^i{n) (5.23) 

n=\Q.* ^l2/ph+H-l^ 



h+H-1 



fe='i TieSk 

Let vi = = ^^*M2i and consider e > 0. From Lemma 3, there exists fci such 

that \4>ki'^) — 4'i^)\ < £ for G ['^ij ^^2], > ^"i- In addition, (5.20) implies that npk € [i^i, 2^2] 
for nG Sk- From these facts and (5.15) it stems that \fk{n)/pk — 4'{npk)\ < e for n £ Sk, 
k>ki. Thus, for fc > /ci + 1 

max fk+i{n) < max [pfe+j(0(npfc+i) + e)] 
ie{-i,o,i} je{-i,o,i} 

(5.24) 

< max {pk+i4>{npk+i)) + epk-i- 

iG{-l,0,l} 

As previously shown, fk{n) ~ Pk4'{'^Pk) equals or exceeds fk-i{n) and fk+i{n) for n G 5^. 
Therefore, from (5.24) 

max fk+i{n) <pk(i){npk)+epk^i. (5.25) 
ie{-i,o,i} 

Substituting (5.25) into (5.23), 

h+H-l 

C{h,H)< J2 ^ (Pfc0(^Pfe)+£Pfc-i) + 2. (5.26) 

k=h neSk 

The number of elements in the set Sk is less than r2*(/i2 — l/yUi)/Pfc + li according to 
(5.20). From (2.5), this upper bound equals r\ogM/pk + 1. Using this in (5.26), and 
taking into account that pk~i < 1, 

h+H-l 

C{h,H)< Y E J'fc'^(^^'fc)+^^(^^^log^^ + l) + 2- (5-27) 

k=h neSk 
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The term X^neSfc -P's'^("'-Pfc) '^^ (5-27) converges to /j^.y'^'^^ (j){iy)diy ~ c* as /c — > oo. Thus, 
for the considered e, there exists k2 such that for all k> k2 



^ Pk<l>{npk) - c* 



<e. (5.28) 



Consequently, defining fco = max{fci + 1, fc2}, inequalities (5.27) and (5.28) imply that for 
all h> ko, for all iJ, and for a G (0, 1], 

C{h, H) < H{c* + e) + eH{rM log M + 1) + 2 

(5.29) 

= H{c* +eP) + 2, 

where P = rA/ log i\/ + 2 > 0. 

Since the minimum of a set cannot be larger than the average of the set, from (5.29) 
it follows that there is some k^ £ {h, . . . ,h + H — 1} such that c^g < C{h, H)/H < c* + 
eP + 2/H. 

Using the foregoing results, the bound lim inf p_^o c(p) < c* for an arbitrary estimator 
can be established by contradiction. Assume that there is some estimator, defined by 
G G J-Tz, such that liminfp^o c{p) ^ c* + d with d> 0. This means that for any e' > 
there exists po such that c{p) > c* + d — e' for all p <pa. Thus taking e' ^ d/3, there is 
Po such that 

c{p)>c*+2d/3 foranp<po- (5.30) 

On the other hand, taking e ~ d/{6P), H — [12/(i], and with a G (0,1] arbitrary, the 
result in the preceding paragraph assures that for any h not smaller than a certain fco 
(which depends on the considered e) there exists k-^ G {/i, . . . , /i + — 1} with 

Cfc3 <c*+eP + 2/i7<c*+d/3. (5.31) 

Let h be selected such that h > max{fco7 — logPo/ logM}. For each k — h, . . . ,h + H —1, 
let Xk denote the (possibly empty) set of all points x G M such that, for the considered 
estimator and for p = pk, the probability that p equals x is at least d/Z. The number of 
points in Xk cannot exceed [3/dJ , for otherwise the sum of their probabilities would be 
greater than 1. The set Xk is determined by fk and G (or by fk and g, in the case of a 
non-randomized estimator defined as p = g{N)). 

Let a G (0, 1] be chosen such that 

Pk/m^^Xk ior a\\ k = h,...,h + H -1. (5.32) 

Such a necessarily exists because (5.32) excludes only a finite number of possible values 
from (0,1]. This choice of a assures that for p = pk, k = h, . . . ,h + H — 1, the probability 
that p equals Pk/ fJ'2 is smaller than d/3. Thus c{pk) < Ck + d/3 for k = h, . . . ,h + H — 1, 
which, together with (5.31), gives 



cipks) <c* +2d/3. 



(5.33) 
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On the other hand, since > h, from the choice of h it fohows that 

Pk, - aM"'"' < aM-^' < aA&sP"/^°s^'' = apo < Po- 

This impUes, according to (5.30), that c{pk^) > c* + 2d/i, in contradiction with (5.33). 
Therefore hniinfp_>.o c(p) < c* . 

It has been shown that, for any estimator, hm inf p_).o c(p) cannot exceed c*. In addi- 
tion, any non-randomized estimator defined by a function g G J-"n with hm„_i.oo ng{n) = 
ri* achieves hm inf p_).o c(p) = hmp_>.oc(p) = c*. Therefore, c* is the maximum of 
hminfp_i.o c(p) over all randomized or non-randomized estimators. □ 

Proof of Theorem 2. Let Ip{z,w) denote the regularized incomplete beta function: 

/p(z,«;) = — ^ f't^-^i-tr-^dt, 

1 



B{z,w)= / t'~'{l-t)'"-'dt 
Jo 

„ r(z)r(^) 
r(z + w) ■ 

From Abramowitz and Stegun (1970), equation (6.6.4), P[N < n] = Ip{r,n — r + 1) for 
n € N, n>r. The confidence for the estimator (3.2) can thus be written as 

c{p) = P[ni <N<n2]^ /p(r,n2 - r + 1) - /p(r,7ii - r), (5.34) 

where rii and ri2 are given by (5.2) with 17 = 17*. It is easy to show that Ip{z^w) is an 
increasing function of w for w G R. Since ni < fl* /{pfii) — d+1 and n2 > fl* fi2/p — d — 1, 
it follows from (5.34) that 

cip) > C2ip) - diip) (5.35) 

with ci{p) = Ip{r, {I* / (pfii) — r — d+1) and C2{p) ~ Ip{r, VL*p2/p— r — d). 
Expressing Ip{z,w) as (Abramowitz and Stegun (1970), equation (26.5.23)) 

2Fi{z,l~w;z + l;p)p'' 
Ip[z,w) = 



B{z,w)z ' 

2Fi{a,b;c;t) = ^ -— — — — , 

^ (c + j-l)U)j! 

the term C2(p) can be written as 

- ! ^-2^1 ^ + ^ + 1 - ^^*M2/p; r + l;p) (5.36) 

nyr^W [L2IP — r — a) r 
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(»>2/p -d- !)('■) ^ {-ly{^*^l2/p -r-d- 1)(^) 
(r-1)! j-'^ ir+j)j\ ^ 

1 {-ly (r>>2 - (d + i)p) ■ ■ ■ - (rf + r + j)^) 



Similarly, 



1 ^ {^ly{n*/^,, -dp)--- (»v^i - (d + r + j - i)p) 

According to (5.37) and (5.38), both ci[p) and C2(p) can be expressed as power series 
in p: ci{p) = ^"^QUip^ , C2{p) — J2ilo'"iP^- Thus, the right-hand side of (5.35) is also a 
power scries, X]i=o^*P*' '^ith Wi — Vi — Ui. The zero-order coefficient is 



Wo 



1 (-1)^'(»>2)''+J" >fi 

{r + j)il ^ 



From the Taylor expansion of 7(r, i) (Abramowitz and Stcgun (1970), equation (6.5.29)), 

, , 1 ^ (-1)^^+^' 



71 



r(r)^t^ (r + j)j! ' 



the coefficient wq is recognized to be c* , reflecting the fact that the difference between the 
two sides of (5.35) is vanishingly small as p — > 0. In addition, the first order coefficient wi 
coincides with the derivative of the right-hand side of (5.35) evaluated at p = 0. It follows 
that wi > is a sufficient condition for the estimator (3.2) to asymptotically guarantee 
the confidence level c* . 

The coefficient wi ~ vi — ui is computed as follows. The term vi is obtained from 
(5.37) as 



Vi ■ 



1 ~ {-iy+\n*fi2y+'-^Kd + i) + --- + id + r+j)] 



(r-1)!^^ {r+j)jl 



Substituting {d + l)-\ \- [d + r + j) ^ {r + j){r + j + 1 + 2d)/2 into (5.39) and taking 

into account that exp(<) = X]j!lo ^Vj-j have 

r + i + 2d^(-iy+\n*fi2y+^-^ i ^ (-ly (n* ^i2y+^ 



-V -J- j! 2(r-l)!^' ?! 

(5.40) 



(l^>2)''-^exp(-»>2) ,^. . 
^(731)^ A^2-r-l-2d). 



Optimum guaranteed confidence in inverse binomial sampling 



511 



Similarly, 



ui = 2{r-l)\ ^ /^il-r + l- 2d). (5.41) 



From (5.40) and (5.41), and making use of (2.5), 

»Vmi)''^ 

2(r-lV 



= (^Vmi)' ^ [M'-icxp(-»>2)(»>2 - r - 1 - 2d) 



- exp(-f77^ii)(r27/ii - r + 1 - 2d)] 
1 / r log M " 



|^r_bgM^ ' M-"^''-^\-{r + 1 + 2d)/Af + r - 1 + 2d], 



2(r- 1)! V M - 1 

which is positive if (3.3) holds. Thus, (3.2) asymptotically guarantees the confidence c* 
for d as in (3.3). □ 

Lemma 4. For r, no G N with r > 3 and uq < {r — y/r)/p, 

/.no "0 - 1 

/ r-iexp(-pt)dt> ^ (n-l)('-i)(l-p)"-'-. (5.42) 

n=r 

Proof. For rig as given, the sub-integral function in (5.42) is increasing within the inte- 
gration range. Consequently, for (5.42) to hold, it is sufficient that n''"^ exp(— np) > 
(n — 1)('"^^^(1 — p)"^'' for n = r, . . . , [(r — ^/r)/p\ — 1. This condition can be shown 
to be satisfied by means of reasoning analogous to that in the proof of Lemma 1 of 
Mendo and Hernando (2008a), part (i). □ 

Proof of Proposition 3. The confidence for the estimator (3.4) is expressed as C2{p) — 
ci(p), ci{p) = P[N <ni — 1], C2{p) = P[N < 7^2], where ni and 712 are given by (5.2) 
with d = 1. Let = 1 — C2. A similar argument as in the proof of Proposition 2, based 
on the Poisson theorem, shows that limp^o ci (p) = ci and limp_i.o c'2 {p) = c'2 , with ci = 
j{r,H/ III), C2 = 1 — 7(r, 51^2)- Thus, to establish the desired result, it suffices to prove 
that ci{p) < ci and c'2{p) < c'2 for fii, fi2 as in (3.5). 

Regarding C2{p), it is shown in Appendix C of Mendo and Hernando (2006) that P[N < 
[a/p\] > limp^^o — L^/pJ] foi' a> r + y/r. Equivalently, for any 9>1, 

P[N > ld{r + V^)/p\ + 1] < lim P[N > [e{r + V^)/p\ + 1]. 

p— 5.0 

This can be made to correspond to C2(p) < c'2 by taking [fi/Li2/p — Ij = [d{r + \^)/p\ , 
that is, 
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Thus, the inequahty C2(p) < holds if fj.2 is given by (5.43) for some > 1 or, equiva- 
Icntly, if 1^2 > {r + y/r + p)/il. Therefore, it holds for arbitrary p if /i2 satisfies the second 
inequality in (3.5). 

As for ci{p), from (2.1), it can be expressed as 

ni — 1 ^ ni — 1 

c,{p) = = E (" - -P)"-'-. (5.44) 



According to (5.2) with d=l, 



ni<— (5.45) 



and thus 

ci =7(r,f7//ii) 



r j-n/(piii) 

r rni 



> P 



(5.46) 



From (5.44), (5.46) and Lemma 4, it follows that the inequality ci{p) < c\ is satisfied if 

ni < (5.47) 
P 

Using (5.45), it is seen that (5.47) is fulfilled if pi satisfies the first inequality in (3.5). □ 



Proof of Theorem 3. For = fi*, using (2.4) and (2.5), the conditions in (3.5) are 
written as (3.7) and (3.8). The left-hand sides of (3.7) and (3.8) are increasing functions of 
M, whereas the right-hand sides decrease with r. These inequalities can thus be written as 
M > hi{r), M > h2{r), where hi, /12 are decreasing functions. This proves that for each r, 
one of the inequalities implies the other. Furthermore, defining h{r) = min{/ii(r), h2{r)}, 
the allowed range for M is expressed as Af > h{r) and h is a decreasing function. It only 
remains to prove that the limiting condition is (3.7) for r ~ {3,4} and (3.8) for r > 5. 

The left-hand sides of (3.7) and (3.8) are continuous functions of M > 1. Consid- 
ering r as if it were a continuous variable, the right-hand sides are also continuous 
functions. Assume that (3.7) implies (3.8) for a given ri, that is, that the latter is sat- 
isfied when the former holds with equality. Likewise, assume that (3.8) implies (3.7) 
for a given r2. The continuity of the involved functions then implies that there exists 
t€ [min{ri, r2}, max{ri, r2}] such that both (3.7) and (3.8) hold with equality for r — t, 
that is, 

M- 1 _ t + ^/t MlogM _ t + Vi+1 

log A/ " t-1 ' M-1 ^ t ■ ^ ' 
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Multiplying both equalities in (5.48) and substituting into the first yields 

t+Vt+1 2Vt+l 
log^-^^ — = (5-49) 

It is easily shown that (5.49) has only one solution, which lies in the interval (4, 5). Thus, 
one of the two conditions (3.7) and (3.8) is the limiting one for r € {3,4}, whereas the 
other is for r = {5, 6,7,...}. Taking any value from each set, the limiting condition is seen 
to be (3.7) in the former case and (3.8) in the latter. □ 
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